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Abstract 

In this paper, we consider a cognitive radio (CR) network in which a secondary multiantenna base 
station (BS) attempts to communicate with multiple secondary users (SUs) using the radio frequency 
spectrum that is originally allocated to multiple primary users (PUs). Here, we employ partially-projected 
regularized zero-forcing (PP-RZF) precoding to control the amount of interference at the PUs and to 
minimize inter-SUs interference. The PP-RZF precoding partially projects the channels of the SUs into 
the null space of the channels from the secondary BS to the PUs. The regularization parameter and the 
projection control parameter are used to balance the transmissions to the PUs and the SUs. However, 
the search for the optimal parameters, which can maximize the ergodic sum-rate of the CR network, 
is a demanding process because it involves Monte-Carlo averaging. Then, we derive a deterministic 
expression for the ergodic sum-rate achieved by the PP-RZF precoding using recent advancements in 
large dimensional random matrix theory. The deterministic equivalent enables us to efficiently determine 
the two critical parameters in the PP-RZF precoding because no Monte-Carlo averaging is required. 
Several insights are also obtained through the analysis. 
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I. Introduction 

The radio frequency spectrum is a valuable but congested natural resource because it is shared 
by an increasing number of users. Cognitive radio (CR) [1-4] is viewed as an effective means to 
improve the utilization of the radio frequency spectrum by introducing dynamic spectrum access 
technology. Such technology allows secondary users (SUs, also known as CR users) to access the 
radio spectrum originally allocated to primary users (PUs). In the CR literature, two cognitive 
spectrum access models have been widely adopted [4]: 1) the opportunistic spectrum access 
model and 2) the concurrent spectrum access model. In the opportunistic spectrum access model, 
SUs carry out spectrum sensing to detect spectrum holes and reconfigure their transmission to 
operate only in the identified holes [1,5]. Meanwhile, in the concurrent spectrum access model, 
SUs transmit simultaneously with PUs as long as interference to PUs is limited [6,7]. 

In this paper, we focus on the concurrent spectrum access model particularly when the 
secondary base station (BS) is equipped with multiple antennas. A desirable condition in the 
concurrent spectrum access model is for SUs to maximize their own performance while minimiz¬ 
ing the interference caused to the PUs. Several transmit schemes have been studied to balance 
the transmissions to the SUs and the PUs [8-12]. In [8], a transmit algorithm has been proposed 
based on the singular value decomposition of the secondary channel after the projection into the 
null space of the channel from the secondary BS to the PUs. A spectrum sharing scheme has 
been designed for a large number of SUs [9], in which the SUs are pre-selected so that their 
channels are nearly orthogonal to the channels of the PUs. Doing so ensures that the SUs cause 
the lowest interference to the PUs. 

In multi-antenna and multiuser downlink systems, a common technique to mitigate the mul¬ 
tiuser interference is a zero-forcing (ZF) precoding [13-16], which is computationally more 
efficient than its non-linear alternatives. However, the achievable rates of the ZF precoding 
are severely compromised when the channel matrix is ill conditioned. Then, regularized ZF 
(RZF) precoding [17,18] is proposed to mitigate the ill-conditioned problem by employing a 
regularization parameter in the channel inversion. The regularization parameter can control the 
amount of introduced interference. Several applications based on the RZ F framework have been 
developed, such as transmitter designs for non-CR broadcast systems [19-22], security systems 
[23,24], and multi-cell cooperative systems [25-28]. 
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While directly applying RZF to CR networks, the secondary BS can only control the inter¬ 
ference in inter-SUs. A partially-projected RZF (PP-RZF) precoding has been proposed [10, 
11], which limits the interference from the SUs to the PUs by combining the RZF [17,18] with 
the channel projection idea [8]. The PP- RZ F precoding follows the classical RZF technique, 
although the former is based on the partially-projected channel, which is obtained by partially 
projecting the channel matrix into the null space of the channel from the secondary BS to the 
PUs. The amount of interference to the PUs decreases with increasing amounts of projection 
into the null space of the PUs, which can be achieved by tuning the projection control parameter. 
However, the search for the optimal regularization parameter and projection control parameter 
is a demanding process because it involves Monte-Carlo averaging. Therefore, a deterministic 
(or large-system) approximation of the signal-to-interference-plus-noise ratio (SINR) for the PP- 
RZF scheme has been derived [10,11]. Unfortunately, only the CR channel with a single PU 
has been studied and the scenario where multiple PUs are present remains unsolved [10]. 

To apply the PP- RZ F precoding scheme in a CR network with multiple PUs, a new analytical 
technique that deals with a mw/ri-dimensional random projection matrix, which is generated by 
partially projecting the channel matrix into the null spaces of multiple PUs, is required. This 
paper aims to address the above mentioned challenge by providing analytical results in a more 
general setting than that in [10,11]. Specifically, we focus on a downlink multiuser CR network 
(Fig. 1), which consists of a secondary BS with multiple antennas, SUs, and PUs as well as 
different channel gains. Our main contributions are summarized below. 

• We derive deterministic equivalents for the SINR and the ergodic sum-rate achieved by 
the PP-RZF precoding under the general CR network. Unlike previous works [10,11], our 
model considers multiple PUs and allows different channel gains from the secondary BS to 
each user. Owing to recent advancements in large dimensional random matrix theory (RMT) 
with respect to complex combinations of different types of independent random matrices 
[29], we identify the large system distribution of the Stieltjes transform for a new class of 
random matrix. Therefore, our extension becomes non trivial and novel. 

• In the PP-RZF precoding, the regularization parameter and the projection control parameter 
can regulate the amount of interference to the SUs and the PUs, but a wrong choice of 
parameters can considerably degrade the performance of the CR network. However, the 
search for the optimal parameters is a demanding process because Monte-Carlo averaging 


4 


is required. We overeome the fundamental diffieulty of applying PP-RZF preeoding in the 
CR network. The deterministie equivalent for the ergodie sum-rate provides an effieient 
way of finding the asymptotieally optimal regularization parameter and the asymptotieally 
optimal projeetion eontrol parameter. Simulation results indieate good agreement with the 
optimum in terms of the ergodie sum-rate. 

• We provide several useful observations on the eondition that the regularization parameter 
and the projeetion eontrol parameter ean aehieve the optimal sum-rate. We also reveal the 
relationship between the parameters and the signal-to-noise ratio (SNR). 

Notations —We use upperease and lowerease boldfaee letters to denote matriees and veetors, 
respeetively. An N x N identity matrix is denoted by Iat, an all-zero matrix by 0, and an all-one 
matrix by 1. The superseripts (■)^, (•)^, and (•)* denote the eonjugate transpose, transpose, 
and eonjugate operations, respeetively. E{-} returns the expeetation with respeet to all random 
variables within the braeket, and log(-) is the natural logarithm. We use [A]fcj, [A]; fc, or Aki to 
denote the entry of the matrix A, and denotes the k-th entry of the eolumn veetor 

a. The operators (-j^, (•)“^,tr(-), and det(-) represent the matrix prineipal square root, inverse, 
traee, and determinant, respeetively, || • || represents the Euelidean norm of an input veetor or 
the speetral norm of an input matrix, and diag(x) denotes a diagonal matrix with x along its 
main diagonal. The notation “ilAy” denotes the almost sure (a.s.) eonvergenee. 

II. System Model and Problem Formulation 

A. System Model 

As illustrated in Fig. 1, we eonsider a downlink multiuser CR network that eonsists of a 
seeondary BS with N antennas (labeled as BS). The BS simultaneously transmits K independent 
messages to K single antenna SUs (labeled as SUi,..., SU^'). We assume that all the SUs share 
the same speetrum with L single antenna PUs (labeled as PUi,..., PUl). Fet h.^ G be the 
fading ehannel veetor between BS and SUfc, G be the fading ehannel veetor between 
BS and PU/, and G be the preeoding veetor of SUfc. The reeeived signal at SU^ ean 

therefore be expressed as 

K 

Vk = hfgfcSfc + iikgjSj + Zk, 

j=l,jj^k 


( 1 ) 
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Fig. 1. A downlink multiuser cognitive radio network. 


where Sk is the data symbol of SU^, s/s are independent and identieally distributed (i.i.d.) data 
symbols with zero mean and unit variance, respectively, and Zk is the additive Gaussian noise 
with zero mean and variance of a^. For ease of exposition, we define H = [hi,... ,hx]^ G 
CAxa f a e C^x^ G ^ [gi,...,gi,] e y a ^ c^ 

s = [si,..., sk]^ G C^, and z = [zi ,..., zxf' G C^. The received signal of all the SUs in 
vector form is given by 

y = HGs + z. (2) 

We also assume that BS satisfies the average total transmit power constraint 

E {tr (GG^)} <iVPT, (3) 

where Pr > 0 is the parameter that determines the power budget of BS. Notably, if we consider 
the instantaneous transmit power constraint, i.e., tr(GG^) < NP^, we can obtain the same 
constraint in a large-system regime, as shown in Appendix B-III. 

The peak received interference power constraint or the average received interference power 
constraint is used to protect the PUs. Given that the latter is more flexible for dynamically 
allocating transmission powers over different fading states than the former [30,31], we employ 
the average received interference power constraint and consider two cases: Case I—the average 
received interference power constraint at each PU and Case II—the total average received 
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interference power constraint at all PUs^ These cases are respectively given by 

Case I (Per PU power constraint): E < Pi, for / = 1,..., L, (4a) 

Case II (Sum power constraint): E {tr (FGG^F^) } < Pain (4b) 

where P/ > 0 denotes the interference power threshold of PUz, and Pau > 0 represents the total 
interference power threshold of all PUs. We then set Pi = OiPt and Pau = 6*aiiPr with Oi^O^a 
being positive scalar parameters to make a connection with the transmit power. Although we 
only consider equal power allocation for simplicity in this paper, our framework can be easily 
extended to arbitrary power allocation by replacing G with GP^, where P = diag(pi,... ,pk) 
with Pfc > 0 being the signal power of SUfc (see [21,22] for a similar application). 

Next, to incorporate path loss and other large-scale fading effects, we model the channel 
vectors by 

hf = v^hf and = (5) 

where h.^ and are the small-scale (or fast) fading vectors, and ri ^ and r 2 ^i denote the large- 
scale fading coefficients (or channel path gains), including the geometric attenuation and shadow 
effect. Using the above notations, the concerned channel matrices can be rewritten as 

H = RfH and F = R|F, (6) 

where H = [-^hij] G and F = g consist of the random components of 

the channel in which hij's and /^’s are i.i.d. complex random variables with zero mean and unit 
variance, respectively, and Ri G and R 2 G are diagonal matrices whose diagonal 

elements are given by [Rij^fc = and [R 2 ];; = r 2 p respectively. In line with [10,11], we 
assume that H is perfectly known to BS in this paper. Since BS needs to predict the interference 
power in (4), we further assume that perfect knowledge of F is available at BS [10,11,32]. 
To acquire perfect channel state information (CSI) for H and F, transmission protocols need 
to incorporate certain cooperation among the PUs, the SUs, and BS [32]. Further research can 
focus on the case with imperfect CSI or estimation of channel [33,34]. 

'Notably, multiple single-antenna PUs exist. These PUs can also be considered a single equivalent PU with multiple receive 
antennas. 
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In the downlink CR network (2), we eonsider the RZF preeoding beeause this preeoding’s 
relatively low eomplexity eompared with dirty paper eoding [17,18,21,27]. However, a direet 
applieation of the eonventional RZF to the seeondary BS will result in a very ineffieient trans¬ 
mission beeause a large power back-off at the secondary BS is required to satisfy the interference 
power constraint (4). Therefore, following [10,11], we adopt the RZF precoding based on the 
partially-projected channel matrix 

H = H(I^-/3W^W), (7) 

where W = (FF^)“^F e and (3 G [0,1] is the projection control parameter. Note that 

the projected channel matrix H is obtained by partially projecting H into the null space of F. 
Specifically, the RZ F precoding matrix is given by 

G = e (H^H + alN) H^, (8) 

where ^ is a normalization parameter that fulfills the BS transmit power constraint (3) and the 
interference power constraint (4), and a > 0 represents the regularization parameter. We refer 
to this precoding as PP-RZF precoding. 

Before setting each of the parameters in (8), two special cases of the PP-RZF precoding are 
considered first. On the one hand, if /3 = 0 then G degrades to the conventional RZ F precoding. 
On the other hand, if /3 = 1 then H is completely orthogonal to F and we have FH^ = 0, i.e., 
no interference signal from the secondary BS will leak to the PUs. Therefore, the interference 
power constraint (4) is naturally guaranteed. Furthermore, the amount of the interference to the 
PUs decreases as the projection control parameter increases. 

Now we return to the setting of the normalization parameter in (8). Considering Case I, from 
(3) and (4a), we have 

<& = I, , --TTT. (9a) 

E|-itr((H"H+alA.) H"H (H"H + oIa-) j| 

e<& = -r-——. - - -^rrr- W 

E (H^H + al^) (H^H + aljv) fij 

To satisfy (3) and (4a) simultaneously, we set = min{^Q, , ( = 1,..., L}. Then, the SINR 




of secondary user SU^ is given by 


7fc = 


hf(H^H + aI^) 'h, 




£l 

e 


p 


h^(H^H + aI^) 'h 


( 10 ) 


phf (H^^H + al^) 'hJ,H[,](H^H + «I^) 'h, + z/’ 

where = [hi, • • •, h^-i, hfc+i, • • •, hi^]^ G C(^- 0 x^, 4 (i^ _/3W^W)hfc, p ^ Pr/a^ 

and 


z/ = — = max 

r 


E|^tr((H^H + «I^) 'h^H(H^H + «I^) |, 

^ E I (H^H + al^) H^H (H^H + al^) } , / = 



( 11 ) 


Here, the equality of (11) follows from (9). For Case II, we have 



Consequently, under the assumption of perfect CSI at both transmitter and receivers, the ergodic 
sum-rate of the CR network with Gaussian signaling can be defined as 

K 

Rsnm - E {log (1 -f 7fc)} . (13) 

k=l 

Note that 7 ^ in the ergodic sum-rate is subject to the BS transmit power constraint in (3) and 
the interference power constraint (to the primary users) in (4). 


B. Problem Formulation 

The SINK 7 ^, in (10) is a function of the regularization parameter a and the projection 
control parameter (3. In the literature, adopting incorrect regularization parameter would degrade 
performance significantly [18,21,27]. In light of the discussion in the previous subsection, one 
can realize that a proper projection control parameter can assist in decreasing the interference 








to the PUs. As a result, using the PP-RZF preeoding effeetively requires obtaining appropriate 
values of a and to optimize eertain performanee metries. In this paper, we are interested in 
finding (a,/3), whieh maximizes the ergodie sum-rate (13). Formally, we have 


{ } = arg max K 


'Sum* 


(14) 


o>0,l>/3>0 


The above problem does not admit a simple elosed-form solution and the solution must be 
eomputed via a two-dimensional line seareh. Monte-Carlo averaging over the ehannels is required 
to evaluate the ergodie sum-rate (13) for eaeh ehoiee of a and /), whieh, unfortunately, makes 
the overall eomputational eomplexity prohibitive. This drawbaek hinders the development of the 
PP-RZF preeoding. To address this problem, we resort to an asymptotie expression of (13) in 
the large-system regime in the next seetion. 


III. Performance Anaeysis of Large Systems 


This seetion presents the main results of the paper. First, we derive deterministie equivalents 
for the SINK 7 ^ and the ergodie sum-rate i?sum in a large-system regime. Then, we identify 
the asymptotieally optimal regularization parameter and the asymptotieally optimal projeetion 
eontrol parameter to aehieve the optimal deterministie equivalent for the ergodie sum-rate. 

A. Deterministic Equivalents for the SINK and the Ergodie Sum-Rate 

We present a deterministie equivalent for the SINK 7 ^ by eonsidering the large-system regime, 
where N, K, and L approaeh infinity, whereas 



L 

N 


are fixed ratios, sueh that 0 < liminfArCi < limsup^yCi < oo ,0 < liminfArC 2 < limsup^C 2 < 
1. For brevity, we simply use AA —)■ exo to represent the quantity in sueh limit. In addition, we 
impose the assumptions below in our derivations. 

Assumption 1: For the ehannel matriees H and G in ( 6 ), we have the following hypotheses: 

1) H = [-^hij] G where hifs are i.i.d. standard Gaussian. 

2) F = e where ffs have the same statistieal properties as hifs. 
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3) Ri = diag(ri^i,..., e and R 2 = diag(r 2 ,i,..., r 2 ,L) G are diagonal 

matrices with uniformly bounded spectral norm^ with respect to K and L, respectively. 

Based on the definition of W in (7), W^W = F^{FF^)-^F = F^(FF^)-^F = W^W, 
where W = (FF^)“^F. Therefore, W is L < rows of an iV x iV Haar-distributed unitary 
random matrix [29, Definition 4.6]. The partially-projected channel matrix H is clearly com¬ 
posed of the product of two different types of independent random matrices. Owing to recent 
advancements in large dimensional RMT [29], we arrive at the following theorem, and the details 
are given in Appendix A. 

Theorem 1: Under Assumption 1, in Case I (per PU power constraint), as AA —)■ 00 , we have 
Ik — Ik 0 , for /c = 1 ,..., iC, where 


Ik 


_p(4 

pbk + v' 


with 


(15) 


+ ^ 2(1 -/^)) 
a + U,fc(U +^ 2(1 


T _ ( (l-Ofc) ti (1 - (1 -/3)afc) (1 -/3)2f2 

Ofc — Ti^k I -:—^-r 


1 + e 


1 -f (1 - /9)2e 


de 

da 


V = max 


de 

da 


I ( h { 1 -/ 3 )H 2 \ ^ r 2 ,i (1 - de ^ 

1-|-(1 — /9)^e / da ’ 61 C 2 1 -f (1 — /3)^e da ’ 

^trRi {alK + (ti + t2(l - Pf) Ri)~^ _ 

(A + TrSSjfe) > (R. (“i/f + («. + *2(1 - PV) R.)-T ’ 



= f = C2 

^ l-fe’ ^ 1-|-(1— /3)^e’ 


(16a) 

(16b) 

(16c) 

(16d) 

(16e) 


and e is given as the unique solution to the fixed point equation 


e ——trRi {oIk + (U + ^ 2(1 ~ (^Y') Ri) 


(17) 


Meanwhile, in Case II (sum power constraint), all asymptotic expressions remain, except for v. 


^[35]: The spectral norm ||| • |||2 is defined on by |||A |||2 = max{%/A ; A is an eigenvalue of A*A}. 
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which should be changed to 

\ ( ti (1 - \ de trR 2 (1 - P)H 2 de \ 

\vl + e 1 + (1 -/3)2ey 5a’0aiiC2 1 + (1 -/9)^e 5a J 


(18) 


An intuitive application of Theorem 1 is that 7 ^ can be approximated by its deterministic 
equivalent^ 7 ^, which can be determined based only on statistical channel knowledge, that is, 
Ri, R 2 , and Note that, according to the definition of the deterministic equivalent (see footnote 
3), in the expression of the deterministic equivalent 7 ^, the parameters N, K, L, as well as the 
matrix dimensions of Ri and R 2 , are finite. Combining Theorem 1 with the continuous mapping 
theorem"^, we have log (1 + 7 ^) — log (1 + 7 ^,) 0. An approximation -Rsum of the ergodic 

sum-rate -Rsum in (13) is obtained by replacing the instantaneous SINK 7 fc with its large system 
approximation 7 ^, that is, 

K 

Rsum = X] log (1 + 7fc) . (19) 

fc=l 

Therefore, when J\f ^ 00, (-Rsum — -Rsum) 0 holds true almost surely. 

To facilitate our understanding of Theorem 1, we look at it from the two special cases as 
follows: 

1) In Theorem 1, we introduce the two variables ti and ^2 to reflect the effects of the projection 
control parameter /). If /) = 1, from (16), then the deterministic equivalent 7 ^. does not 
depend on ^ 2 - Substituting 5 = 1 into (15) and letting Ri = Ik, we have 

picfil - C2){1 + C{p,p,a)f - C{p,r],aY) 

Ik = — - - - 72 -( 20 ) 

p+ (1 + C(/^,^,«)) 


^[29, Definition 6.1] (also see [36]): Consider a series of Hermitian random matrices Bi, B 2 , .. ., with Bjv G and 

a series /i, / 2 , ■ • ■ of functionals of 1 x 1, 2 x 2,.. . matrices. A deterministic equivalent of Bjv for functional /jv is a series 
B°, B 2 , ..., where B^ G , of deterministic matrices, such that limjv->oo /jv(Bjv) — 0- In this case, the 

convergence often be with probability one. Similarly, we term gjv = /Ar(B^) the deterministic equivalent of /jv(Biv), that is, 
the deterministic series gi,g 2 , ■ ■such that /jv(Bjv) — gfAr —>■ 0 in some sense. 

Note that the deterministic equivalent of the Hermitian random matrix Bjv is a deterministic and finite dimensional matrix 
B5,f. In addition, the deterministic equivalent of /jv(Biv) is gn — /jv(B^), which is a function of B^. 

"*[37, Theorem 25.7-Corollary 2]: If Xn a and h is continuous function at a, then h{xn) 222^ h{a). 
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where C{fi,ri,a) = ti/a, ^ = 1 — C 2 , and r] = 1/ci. Combining (16e) and (17), we obtain 

1 + 


C(/i,r7,«) = - = 7 ; 

a 2 


^ 1 ( fi-r] I - T])^ 2{fi + r]) 


a 


+ 


+ 1 




a 


( 21 ) 


Before providing an observation based on the above, we briefly review a well-known result 
from the large dimensional RMT. First, we eonsider the definition of H from (6). If Ri = 
\k, the entries of the K x N matrix H are zero mean i.i.d. with varianee 1/N. Following 
[29, Chapter 3], we see that as N, K ^ 00 with N/K —>■ ci, h.^ (H^H-|-alAr) ^ 
eonverges almost surely to 


A -f cr 


/(A) d\ 


where 


f[X) = {i-^)U{\) + 


V'(A-a)+(6-A)+ 


27rA 


( 22 ) 


(23) 


with (a;)'*' = max{a:, 0}, a = (1 — and b = {1 + In faet, f{u) is the limiting 

empirieal distribution of the eigenvalues of and is known as the Marccenko-Pastur 

law [38]. The integral of (22) ean be evaluated in elosed form 


1 

2 


\ a 




(1 -^) 
^,2 


2 


I 2(1+ n) 

a 



(24) 


Note that (21) is equal to (24) when /i is replaeed with 1, i.e., (24) is equal to C(l, "7, «)• In 
faet, following the similar derivations of Theorem 1, we ean show that (20) and the SINK 
of the eonventional RZF preeoding share the same formulation by replaeing ({n,r],a) in 
(20) with Substituting the definitions of ci,C 2 into /i and r], we have jj, — rf = 

1 — C 2 — 1/ci = (iV — (L + K))/N. Comparing this value with 1 — rj = {N — K)/N in 
(24), we thus eonelude that if /i = 1, the SINK of the PP-RZF preeoding is similar^ to 
that of the eonventional RZF preeoding but with an inerease in the number of aetive users 
from K to K + L. Henee, the degrees of freedom of the PP- RZ F preeoding is redueed to 
N — {K + L) beeause the additional L degrees of freedom are used to suppress interferenee 
to the PUs. 


^Notably, when (3 = 1, the SINRs of the PP-RZF precoding and the conventional RZF precoding are similar but not identical 
because C,{tL,r],Q) is replaced with (^{l,ri,a). 
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2) For another extreme case with /9 = 0 in Theorem 1, ti +t 2 = Letting Ri = 1^, we 
obtain = ((!,??,«), such that 



(25) 


where z/q = max{l, r2,i/6'/, / = is for Case I and uq = max{l, trR2/6'aii} is 

for Case II. The received interference power constraint at the PUs (4) can be controlled 
only through z/q, where /3 is not involved in uq. Therefore, the SINK 7 ^ is significantly 
degraded if the channel path gains between the BS and the PUs (that is, r 2 /s) are strong. 
However, if the channel path gains between the BS and the PUs are weak, then z^o = 1 
and 7 ^ behave in a manner similar to but not identical to that of the conventional RZF 
precoding because ci is replaced with ci(l — C 2 ). 

Comparing (25) for P = 0 with (20) for /3 = 1 obtains notable results. First, we note 
that (20) and (25) share a similar formulation, except the additional uq appears at the 
denominator of (25). When P = 1, the secondary BS yields zero interference on the 
PUs, such that the interference power constraint in (4) is always inactive. Therefore, no 
additional parameter z/q is required to reflect the received interference power constraint at 
the PUs. Although oq > 1, the SINR performance of the PP-RZF precoding with P = I 
is not implied to be always better than that with P = 0. An additional note should be 
given on ('(•, 77 , a), where the argument ■ is /r for = 1 and 1 for P = 0. The parameter 
/i = {N — L)/N for P = 1 implies that the additional L degrees of freedom is used to 
suppress interference to the PUs. Consequently, if the channel path gains between the BS 
and the PUs are weak, the SINR performance of the PP-RZF precoding with P = 1 shall 
not be better than that with P = 0. Thus, we infer that the projection control parameter 
should be decreased if the received interference power constraint at the PUs is relaxed. 
Finally, we note that C(l, 7 , a) agrees with .^(r, oq) in [10,11, Theorem 1]. As a result, (25) 
is identical to the deterministic equivalent for the SINR obtained in [10,11, Theorem 1], 
where the PP-RZF precoding with a single PU is considered. The deterministic equivalent 
for the SINR in [10,11, Theorem 1] is clearly a special case of (15) with /3 = 0 even 
though the case of /3 7 ^ 0 is considered in [10,11, Theorem 1] because a single PU results 
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only in one-dimensional perturbation, and the effeet of such perturbation vanishes in a 
large system. Even if the number of PUs L is finite and only N becomes large, the effect 
of 13 vanishes. The lack of a relation between jS and the SINRs will result in a bias when 
the number of antennas at the BS is not so large. However, our analytical results show the 
effect of (3 by assuming that N, K, and L are large, whereas Ci = N/K and C 2 = L/N 
are fixed ratios. Thus, our results are clearly more general than those in [10,11]. 

Corollary 1: In addition to the assumptions of Theorem 1, we suppose further that C 2 = 1 
(that is, N = L), Ri = ril^, and (3 G [0,1). Then, as AA ^ oo, we have 7 ^ — 7 0 for 

k = 1,... ,K, where 

^ _ p {cirl - {ciae - rif) 

7 ^ ^2 I ’ (2^) 

p(ciae)"' + Vo 


and e is given as an unique solution to the fixed point equation 


_ri(l + e(l - (3f) _ 

cia(l + e(l - j3Y) -f- ciri(l - /3)2’ 


and Vo = ri max{l, r2^i/9i, / = 1 ,..., L} for Case I or i/q = ''"1 max{l, trR 2 / 6 'aii} for Case II. 

Proof: By letting C 2 = 1 and Ri = rili^-, we immediately obtain the result from Theorems 
1 and 2 . ■ 

For a brief illustration, we consider only Case II of Corollary 1 because the same characteristics 
can be found in Case I. Given that ^aii = PaiiZ-Pr = Paii/(c’‘^p)> (26) can be rewritten as 


Cirf — (ciae — rif 


7 = < 


(CiQ!e)2 1/p 

Cirf — (ciae — 

^ (ciae)2-h cr2trR2/Paii’ 




Pall 

.2. 


1 < 


p(T^trR2 


(27) 


P 


all 


We can see that 7 does not depend on the SNR p when 1 < p(T^trR 2 /Paii. In this case, the system 
performance is interference-limited. Notably, the assumptions of C 2 = 1 and /3 7 ^ 1 are taken in 
Corollary 1. In the case of C 2 = 1 and (3 = 1, from (16a), we have = 0 and consequently 
= 0, which implies a failure in the transmission. This result is reasonable because when 
C 2 = 1, the dimension of the null space of F is zero with probability one.^ Therefore, the setting 


®If N = L, we have Rank(I -- F^(FF^) ^F) = 0 with probability one because from [39, Theorem 1.1], F is a full rank 
square matrix with probability one. 
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of /9 = 1 results in transmission failure, even when the channel path gains between the BS 
and the PUs are weak. We thus show that a choice of appropriate jS significantly affects the 
successful operation of the CR network, which serves as motivation for the remainder of this 
paper. 

B. Asymptotically Optimal Parameters 

Our numerical results confirm the high accuracy of the deterministic equivalent for the ergodic 
sum-rate i?sum in the next section. Therefore, the deterministic equivalent for the ergodic sum- 
rate can be used to determine the regularization parameter a and the projection control parameter 
(3. By replacing i?sum with i?sum in (14), we focus on this particular optimization to maximize 
the deterministic equivalent for the ergodic sum-rate 

= argmaxi?sum- (28) 

^ q>0,1>/3>0 

Similar to the problem in (14), the asymptotically optimal solutions and do not 
permit closed-form solutions. However, the asymptotically optimal solution can be computed 
efficiently via the following methods without the need for Monte-Carlo averaging because 
is deterministic. First, given that (3 is fixed, the optimal a°P*(/3) := argmaxQ,>o-Rsum(/9) can 
be obtained efficiently via one-dimensional line search [21,27], which performs the simple 
gradient method. The complexity in this part is linear. Then, we obtain the optimal := 
argmaxQ<^<^ /^) through the one-dimensional exhaustive search^. Finally, the 

optimal parameters are given by For a special case, we obtain a condition of 

the optimal solutions in the following proposition: 

Proposition 1: Under the assumptions of Corollary 1, the asymptotically optimal parameters 
a°P* and satisfy the equation 



pClTi 


where e [0, !)• 


^Although the one-dimensional exhaustive search seems burdensome, the case in question here is easy because the search is 
only over a closed set 0 < /? < 1. 
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Proof: By differentiating -Rgum with respeet to a and (3, we immediately obtain the result 
from Corollary 1. ■ 

From Proposition 1, we note that the number of asymptotieally optimal solutions is infinite. 
All a’s and /3’s that satisfy (29) are optimal. This eondition will be eonfirmed in the next seetion. 
Similar to (27), we eonsider Case II for brief illustration. In this ease, (29) ean be rewritten 


as 


' (1 - ^ pcj^trRa 

PCITI Pall 

(1 - ^’’*)^(j^trR2 ^ ^ pcr^trR2 

. CiriPall ’ Pall 


(30) 


From (27), when 0 < pcr^trR 2 /Paii < 1, the system performanee is unaffeeted by the average 
reeeived interferenee power eonstraint. In this ease, is expeeted to be elose to 0 beeause the 
weak interferenee at all the PUs is negligible. This eondition is eombined with the first term of 
(30) to reveal that deereases with inereasing p, where p = Pr/cr^ is the same as previously 
defined. However, when p(T^trR 2 /Paii > 1, the system performanee is limited by the average 
received interference power constraint. To decrease the interference, (3°^^ is expected to be close 
to 1. Therefore, the second term of (30) reveals that a decreases to 0 with an increase in 
We end this section by observing two additional extreme cases in Theorem 1 for Ri = li^-: 
If /3 = 0, by means of some algebraic manipulations, we obtain cpP* = z/o/(cip). By contrast, 
if /3 = 1 and C 2 7 ^ 1, we obtain cpP* = l/(cip). We find that the optimal regularization 
parameter tends to decrease monotonically with increasing p, as expected. This characteristic is 
similar to that of the conventional RZF precoding in [18,19], where ri = 1 is assumed and the 
asymptotically optimal regularization parameter = l/(cip) is derived. 


IV. Simulations 

In this section, we conduct simulations to confirm our analytical results. First, we compare 
the analytical results (19) in Theorem 1 and the Monte-Carlo simulation results (13) obtained 
from averaging over a large number of i.i.d. Rayleigh fading channels. In the simulations, we 
set channel path gains ri fc = 1 and = 0.6 for all k and I and assume that Pi = P for all I 
in Case I and Pan = LP in Case II. Several characteristics of Cases I and II are similar. Thus, 
without loss of generality, we provide the numerical results of Case I only. 
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Fig. 2. Ergodic sum-rate and the deterministic equivalent results under different interference power threshold and and two 
different antenna configuration cases. 


Fig. 2 compares the ergodic sum-rate and its deterministie equivalent result under different 
interferenee power thresholds P E {— lOdB, OdB} and two different antenna configuration cases: 
{N = 10, K = 8, L = 6} and {N = 16, K = 8, L = 6}. In the simulation, 
is obtained by using the two-dimensional line seareh in (14). We find that the deterministie 
equivalent is aeeurate under various settings even for systems with a not-so-large number of 
antennas. In addition, Fig. 2 illustrates that for the ease with {N = 10, K = 8, L = 6}, the 
sum-rate of the SUs eannot inerease linearly in SNR and beeomes interferenee-limited beeause 
the sum-rate of the SUs is easily restrieted by the average received interference power at each 
PU, particularly when the number of aetive users is larger than the number of antennas at the 
BS, that is, L + K > N. 

In the above simulations, the best solutions of {a°P\/3°P*} are calculated by Monte-Carlo 
averaging over 10^ independent trials; doing so whieh elearly results in a high eomputational 
eost. To eonfirm that the optimization based on the deterministie equivalent is not only more 
eomputationally effieient but also near-optimal, we eompare the ergodic sum-rate of the PP-RZF 
preeoding with P = OdB and {N = 16, K = 8, L = 6} in Fig. 3 for the following four eases: 1) 
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Fig. 3. Ergodic sum-rate results under various parameters with P = OdB and {A^ = 16, K = 8, L — 6}. 


2) {a°Pt, /3°P*}, 3) {a°P*, P = 0}, and 4) {a°P*, P = 1}. The solution of {a“P*, 
is obtained by using the two-dimensional line seareh in (28). {a°P*,^^*} provides results that are 
indistinguishable from those achieved by {a°P*,/3 °p*^}, which demonstrates that the optimization 
based on the deterministic equivalent is promising. Moreover, the performance is significantly 
improved if the PP-RZF precoding with an appropriate choice of {a,P} is employed. In the 
low-SNR regime, the optimal transmission becomes the conventional RZF precoding, whereas 
the optimal transmission is the PP- RZ F precoding with /) = 1 in the high-SNR regime. 

To provide further results on the optimal solutions of {«,/)}, Figs. 4 and 5 show the values of 
{a°P*,/9°P*} under various settings. We have observed that the optimal parameter 
based on the deterministic equivalent result is almost consistent with {a°P*,/9°P*} 
based on the ergodic sum-rate. Moreover, we have observed that with increasing p, «°p* (or cT’p*) 
tends to monotonically decrease to 0, whereas /3 °p* (or P°^^) tends to monotonically increase 
from 0 to 1. These characteristics are expected based on the analysis in Section III. 

Finally, we confirm the result in Proposition 1. Fig. 6 displays the ergodic sum-rate under 
various parameter settings with P = OdB and {N = 10, K = 8, L = 10}. We find that when 
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Fig. 4. Optimal a under different the interference power threshold for =16, K = 8, L = 6}. 



Fig. 5. Optimal under different interference power threshold for {A^ = 16, K — 8, L = 6}. 
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Fig. 6. Ergodic sum-rate results under various parameters for P = OdB and {A^ = 10, K = 8, L — 10}. 


C 2 = 1, the parameters that satisfy (29) ean aehieve the asymptotieally optimal sum-rate for any 
P G [0,1), sueh that infinitely many asymptotieally optimal solutions exist. 

V. Conclusion 

By exploiting the reeent advaneements in large dimensional RMT, we investigated downlink 
multiuser CR networks that eonsist of multiple SUs and multiple PUs. The deterministic equiv¬ 
alent of the ergodic sum-rate based on the PP-RZF precoding was derived. Numerical results 
revealed that the deterministic equivalent sum-rate provides reliable performance predictions even 
for systems with a not-so-large number of antennas. We thus used the deterministic equivalent 
result to identify the asymptotically optimal regularization parameter and the asymptotically op¬ 
timal projection control parameter. In addition, we provided the condition that the regularization 
parameter and the projection control parameter are asymptotically optimal. Several insights have 
been gained into the optimal PP-RZF precoding design. A natural extension of this is to consider 
the PP- RZ F precoding under various scenarios, such as spatial correlations and imperfect CSI at 
the transmitter. However, such development is still ongoing because of mathematical difficulties. 
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Appendix A: Proof of Theorem 1 

To complete this proof, we first introduce the limiting distribution for a new class of random 
Hermitian matrix in Theorem 2. Such distribution serves as the mathematieal basis for the latter 
derivation. We recall the definition of the Stieltjes transform (see, e.g., [40]). For a Hermitian 
matrix Bat G the Stieltjes transform of Bat, is defined as 

= ^tr (Bn + alN)~^ for a G M+. 

For ease of explanation, we also define the matrix produet Stieltjes transform of Bat as 

?7ibjv.q(«) = , 

where Q is any matrix with bounded speetrum norm (with respeet to N). 

Notably, both and mBjv,Q(«) are functions of a, but for ease of notation, a is dropped. 

In addition, all the subsequent approximations will be performed under the limit Af ^ oo, and 
for ease of expression, a x 6 denotes that a — b 0 as Af ^ oo. 

Theorem 2: Consider an x matrix of the following form: 

Bjv = H^H = (Iat -/3W^W)H^RiH(ljv -/3W^W), (31) 

where W, H, and Ri follow the restrietions given by Assumption 1. Then, as AA —)■ cx), we 
have 

tl+t2 1 . 

^Bjv.Q - - 

a iV 

where ti = 4^ and ^2 = i_|_e(‘f _^)2 with e being the unique solution to the fixed point equation 

e =^trRi (ali^ + (fi + ^ 2(1 -/3)^) Ri) (33) 

Proof: If C 2 = 1 (i.e., N = L), W^W = I^, the result is direetly obtained by Lemma 4 
(see Appendix C). 

We eonsider the ease with C 2 < 1. Given that ttibjv.q is a funetion of two random matriees W 
and H, we aim to derive an iterative deterministie equivalent [41] of mB^,Q. In particular, we 
first find a function ^Af(W, a), such that /Ar((H, W), a) x ^Ar(W, a), where /Ar((H, W), a) = 
^Af(W, a) is a funetion of W and is independent of {H}Ar>i. Notably, ^Af(W, a) is 
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a deterministic equivalent of /Ar((H, W), a) with respect to random matrix sequences {H} 7 v>i- 
Second, we further find a function QNict), such that ^Ar(W, a) x gN{oi). Thus, we obtain an 
iterative deterministic equivalent gN{oi) of /Ar((H, W), a), i.e., /Ar((H, W), a) x gN{oi). 

When W is treated as a deterministic matrix, applying Lemma 4 (see Appendix C), we have 

^trQ (B^ + alAr)-' x ^trQ + ae(l^ - , (34) 

where 


e = (alii- + eRi) ^, (35) 

e = ^tr(liv - /3W^W)' (liv + e(liv - . (36) 

Notice the fact that (W^W)^ = W^W so (34) and (36) can be written respectively as 

itrQ (al„ + ae(lA, - /3W«W)")”' = + W«W)-‘, (37) 


and 

' ^ + W"W)-‘w„ (38) 


where uj ^ and w/ denotes the /—th row of W. 

Next, we aim to derive the deterministic equivalents of the terms + W^W)“^ 

and w/^ (celAT + W^W)“^w;. Applying a result of the Haar matrix in Lemma 5 (see Appendix 
C) to (37) and combing (34), we immediately get (32). Then, we deal with the deterministic 
equivalent of (^ujlisf + W^W) ^w;. According to the matrix inverse lemma (see, e.g., [42, 
Lemma 2.1]*), we find 


wf(a;Ijv +W^W) 


wf(a;Iiv + WgW[z]) ^wi 
l + wf(a;Iiv + WjjW[z])-'w; 


(39) 


where W[;] = [wi,..., wj_i, w;+i,..., e i)xAf xhen, the trace lemma for isometric 


*[42, Lemma 2.1]: For any A G ' 


and q G C" with A and A + qq^ invertible, we have 
-1 1 


H t A , H 

q A + qq 


1 + q^A-iq 


H A -1 

33 —q A 
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matrices [43,44] gives us 

wf{ulN + Wj] W[;])”Vi ^ ~ Wj|W[;]) (uIn + Wj|W[z])”^ 

+ WhW|„)-‘ - (40) 

Now, applying [42, Lemma 2.2] and (72) to (40), we get 

wNuIn + W^Wy^wi X (41) 

ce + 1 

Substituting (41) into (38) and using (72) and (35), we obtain (33). ■ 

Note that mBjv,Q, e, ti, and t 2 are all functions of a and /3, but for ease of expression, a and 
f5 are dropped. 

Theorem 2 indieates that mB.q can be approximated by its deterministic equivalent 
without knowing the actual realization of channel random components. The deterministic equiva¬ 
lent is analytical and is much easier to compute than EB{mB,Q}, which requires time-consuming 
Monte-Carlo simulations. Motivated by this result in the large system limit, we aim to derive 
the deterministic equivalent of 

The SINK in (10) consists of three terms: (i) the signal power |h|^(H^H -f QT 7 v)“^hA:p, 
(ii) the interference power h|^(H^H -f QTAr)"^H^jP[fc]H[fc](H-^H -f QTAr)“^hfc, and (iii) the 
noise power u. Using Theorem 2, we establish the following three lemmas to derive the deter¬ 
ministic equivalent of each term, whose proofs are detailed in Appendices B-I, B-II, and B-III, 
successively. 

Lemma 1: Under the assumption of Theorem 2, as AA ^ cx), we have 

hf (H^H + aI^)"'hfcXafc, (42) 

where has been obtained by (16a). 

Lemma 2: Under the assumption of Theorem 2, as AA —?■ oo, we have 

hf (H^H + (H^H + x h, (43) 

where bk has been obtained by (16b). 
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Lemma 3: Under the assumption of Theorem 2, as AA —?■ oo, we have 

z/ X Z7, (44) 

where V ean be obtained by (16e) for Case I and by (18) for Case If. 

Aeeording to Lemma 1, Lemma 2, and Lemma 3, we obtain the deterministie equivalent 7 ^ 
of in (15). The proof is then eompleted. 


Appendix B: Proofs of Lemma 1, Lemma 2, and Lemma 3 
B-I: Proof of Lemma 1 

We start from an applieation of the matrix inverse lemma [42, Lemma 2.1] to the signal term, 


whieh results in 


h" (H"H + oIk) ' hi = 


l + hf(H»H[i, + ol7-‘hi' 


(45) 


Using [42, Lemma 2.3 and Lemma 2.2], we obtain 


hf + «lAr) 'hfcXri,,ltr(H^H + «I^) ' - ri,fc/3^trW^W (H^H + «lAr) 


-1 


'N 


N 


(46) 


Similarly, 


xr,,i2tr (H"H + al^)'' + - 2/3)2trW"W (H"H + al„)"‘. (47) 

Aeeording to Theorem 2, we have 

Itr (H^H + X (48) 

N ^ 'a 

Notieing that W^W = W^W and by using the same approaeh as (38), we obtain 

ltrW«W(H''H + x (49) 
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Substituting (48) and (49) into (46) and (47), we obtain 

\,H (uHyt I T ^ (^1 + ^ 2(1 — /5)) 

UH/TT/fu I T )“^V, ^ ■'"l.fc (^1 + ^ 2(1 — /5)^) 

(-n-[fc]bl[fc] + aiAf) hfc - - - -■ 


Consequently, the expression of (45), together with (50) and (51), yields (42). 


(50) 

(51) 


B-II: Proof of Lemma 2 

Using the faet that — D“^ = —A“^(A — D)D“^, we have 

hf (H^H + (H^H + crl^)”' 

=hf (H^H + hfc - «hf (HgH[fc] + (H^H + 

- hf (H^H + hfchf (H^H + 

+ ahf (H^H + aljv)”' h^hf (HgH[,] + aljv)”' (H^H + h^. (52) 


Applying the matrix inverse lemma, we obtain 


h^(H^H + alA,) 'hfc 
=hf (HgH[fc] + al7 v)"^hfc 


Similarly, 


hf(Hg,H [fc] + al^) 

l + hf(H»H[,| + aI„)-‘h» 


(53) 


hf(HgH[t, + aI„) ‘(H''H + aI„) ‘hi 


=h" (HgH|t| + aIjv)""ln 


h"(H"Hn.i +aI«)-^hthf(H"H,ti +al„)~'ht 
l + h?(HS%| + alA,)"‘h» 


(54) 


and 



(H|H[fc]+«I^) '(H^H + «I^) 'h, 


+ alAr) hfc — 


hf (HgjH;,] + + crl^)-^h 

l + hf(HgH[fc] + alA()-^h. 


(55) 
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According to Theorem 2, we have 

hf (HgH[fc] + al^)-'h, X (56) 

Noticing that 

d 

hf (HgH[fc] + al^)"^hfc = -^hf (HgH[fc] + al^)”^ h^, 

we thus obtain 

h« (h 5!,H[„ + alA-) h, X -n,, A , (57) 

Similarly, eombining (50) and (51) yields 

hf X -r.,,A (h^Sl^Dh'^ . (58) 

h" (HgHm + al„)"" hj X -r,,» A . ( 59 ) 

Substituting (50), (51), (56), (57), (58), and (59) into (53), (54), and (55), and eombining (42) 
and (52), we obtain (43). 

B-lIl: Proof of Lemma 3 
From (11), we first have 

^tr (H^H + H^H (H^H + aljv)"^ 

= ^tr (H^H + aljv)”^ - a^tr (H^H + ctItv)”^ , (60) 

whieh, together with Theorem 2, yields 

Itr (H^H + H^H (H^H + x ^ (61) 

For Case I, we have 

(H^H + aljv) H^H (H^H + al^) f, 

=f;^ (H^H + aljv)”' f« - (H^H + f^, 


( 62 ) 
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where / = 1,..., L. From (34) and (37), we obtain 

f,« (H"H + aI^,) f, X (xl« + W« W) . (63) 

Notieing the fact that W = (FF^)“5F, by using the matrix inversion formula^, we obtain 


(celjv + W^w) ^ 


=trf;f;^ - ce-^F^ (FF^ + a;-^FF^) ^ Fce”^) 


cu + 1 




H 


^2,1 
UJ +1 


(64) 


As a result, 


(H^H + ctIat) (65) 

C2a 

Combining this with (62), we obtain 

(H^H + aljv)"' H^H (H^H + aljv)"' f/ x (66) 

C2 oa 

From (61) and (66), the proof of (16c) can be accomplished using 

^tr(H^H + al7v)"^H^H(H^H + 

xE|^tr(H^H + aIjv)”^11^6(11^11+ al7v)"^| , (67a) 

f^^(H^H + aI^)”^H^H(H^H + al7v)"^fi 
xE |f,^(H^H + alAr)”^H^H(H^H + aljv)”^fz} • (67b) 


By using the martingale approach, we can prove (67) (See [45] for a similar application). 
Similarly, for Case II, we have 

E |trF(H^H + aIjv)”^H^H(H^H + aIjv)”^F^| x (68) 

I ^ ^ ^ ^ ) C2 oa 

Therefore, we obtain (18). 


Appendix C: Related Lemmas 

In this appendix, we provide some lemmas needed in the proof of Appendix A. 


^For invertible A, B and R matrices, suppose that B = A + XRY, thenB^i = A'^ - A^iX(R'i + YA-iX)-iYA~f 
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Lemma 4: Let X = [-^Xij] e where Xj/s are i.i.d. with zero mean, unit varianee 

and finite 4-th order moment. In addition, let Q G T G and R G be 

nonnegative definite matriees with uniformly bounded speetral norm (with respeet to N, N, 
and K, respeetively). Consider an X x X matrix of the form B^r = T^XRX^T^. Define 
Cl = N/K. Then, as X, X — )■ cxo sueh that 0 < liminfivCi < limsup^vCi < oo, the following 
holds for any oj G M+: 

^trQ{BN + ujIn) ^ X ^trQ (cjIat + a;eT)"\ (69) 

where e is given as the unique solution to the fixed-point equations 

e = ^trR(a;Ix + eR)”\ 
e = ^trT(Ijv + eT)"^ 

Proof: As a speeial ease of [46, Theorem 1] or [21, Theorem 1], the result ean be obtained 
immediately. ■ 

Lemma 5: Let Q G be a nonnegative definite matrix with uniformly bounded speetral 

norm (with respeet to X) and W G be L < X rows of an X X X Haar-distributed 

unitary random matrix. Define C 2 = L/N. Then, as L,X —)■ oo sueh that 0 < liminfjvC 2 < 
limsup^C 2 < 1, the following holds for any u) G M+: 

4trq(W"W + i.,lK)''x (^ + LL£i)ltrQ, (70) 

X yCJ + 1 OJ J N 

Proof: Sinee W^W = 1^ for C 2 = 1 (i.e., X = L), (70) evidently holds. We assume C 2 < 1 
in the following proof. Firstly, we eonsider a speeial ease with Q = I. Using the identity of the 
Stieltjes transform [29, Lemma 3.1] we have 

—tr(W^W + oj1n)~^ = -tr(WW^ + oj1l)~^ + (71) 

X ^ ' L ^ OJ 

*°[29, Lemma 3.1]: Let A G jj £ f^nxN^ Hermitian. Then, for « G C\R 

n , , , , N — n 1 

— mBA{z) = mAs(z) + 


N z' 
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Notice that the rows of W are orthogonal and henee WW^ = 1^. Therefore, 


N 


tr(W^W + a;I^) 


-1 r A C2 1 — C2 

' 0 = -r H-• 

00 


(72) 


00+1 

Next, for any nonnegative definite matrix with uniformly bounded spectral norm (with respect 
to iV) Q, we have 

ltrQ(W^W + 

= {1 - 6u)—trQ{W^W + uIn) ^ - 5 ^ wf Q(W^W + ccItv) (73) 


1=1 


where the first equality follows from the resolvent identity: = A“^(B — A)B~^ for 

invertible matriees A and B. Using the matrix inverse lemma [42, Lemma 2.1], the traee lemma 
for isometrie matriees [43,44], and the faet that Q has uniformly bounded speetral norm (with 
respeet to N), we obtain 


wf Q (W^ W + ccljv) 




1=1 


'C2 


^l + wf(W|W[;]+a;I^) 

(1 + cu)AtrQ(W^W + u 1 n )~' - ^trQ 


(l+cc)itr(W^^W + a;I^) - C2 

Substituting (74) into (73), and combining (72), yields 


(74) 


(l + a;)(5(^^trQ(W^W + u;l7v) - Sj^trQ 

(1 + u)6 - C2 


(75) 


Therefore, we get (70). 
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